Subject analysis and indexing : from automated indexing to domain analysis
نویسنده
چکیده
Preparing an index to a book or assigning indexing terms to documents involves the task of subject analy sis. Subject analysis can be defined as "the intellectual or automated process by which the subjects of a docu ment are analyzed for subsequent expression in the form of subject data'.1 Is automatic indexing a chal lenge, or even a threat to indexers? What do we mean when we talk about "subjects' of books and other documents? Are there different conceptions of subjects, and hence of subject analysis; and if so, are such con ceptions interconnected with methods applied for indexing? These questions are important because discussions of indexing tend to focus on the performance of auto mated indexing2 versus the performance of human indexing,' usually based on relevance judgments com piled from particular groups of IR-system users,4 or observations of interindexer consistency.Using perfor mance as a criterion for finding a 'winning' approach to subject analysis and indexing implies a model which restricts itself to comparing the two techniques. Based on empirical observations of user and indexer behav iour, it represents a mechanistic evaluation method for subject assignment and retrieval. This article presents an alternative model for discussing subject analysis and indexing, where the choice between particular methods applied for this task is of minor importance. The inten tion is to attempt to place indexing in a wider social context beyond such mechanistic evaluation methods and to point towards new challenges for us as indexers.
منابع مشابه
A Comparing between the impacts of text based indexing and folksonomy on ranking of images search via Google search engine
Background and Aim: The purpose of this study was to compare the impact of text based indexing and folksonomy in image retrieval via Google search engine. Methods: This study used experimental method. The sample is 30 images extracted from the book “Gray anatomy”. The research was carried out in 4 stages; in the first stage, images were uploaded to an “Instagram” account so the images are tagge...
متن کاملCompressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard
Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...
متن کاملProbabilistic Latent Semantic Indexing Proceedings of the Twenty-Second Annual International SIGIR Conference on Research and Development in Information Retrieval
Probabilistic Latent Semantic Indexing is a novel approach to automated document indexing which is based on a statistical latent class model for factor analysis of count data. Fitted from a training corpus of text documents by a generalization of the Expectation Maximization algorithm, the utilized model is able to deal with domain{speci c synonymy as well as with polysemous words. In contrast ...
متن کاملModeling and Diagnosing Domain Knowledge Using Latent Semantic Indexing
A Latent Semantic Index (LSI) was constructed from arguments made by Navy officers concerning events in an Anti-Air Warfare scenario. A model based on LSI factor values predicted level of domain expertise with 89% accuracy. The LSI factor space was reduced using MDS to five dimensions: aircraft route, aircraft response, kinematics, localization, and an unclassifiable element. Arguments in the l...
متن کاملتأملاتی بر نمایه سازی تصاویر: یک تصویر ارزشی برابر با هزار واژه
Purpose: This paper presents various image indexing techniques and discusses their advantages and limitations. Methodology: conducting a review of the literature review, it identifies three main image indexing techniques, namely concept-based image indexing, content-based image indexing and folksonomy. It then describes each technique. Findings: Concept-based image indexing is te...
متن کامل